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551 
601 
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1451 
1501 
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1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
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2051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 
2451 
2501 
2551 
2601 
2651 
2701 
2751 
2801 
2851 
2901 
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TTTCTTCTGT 
CCGCCCAAGC 
CATGAGTTCC 
GCGGGGAGAT 
CCCCGCGCCT 
CTAGCCCAGC 
GAGATGGAGG 
CCTGCTGCTG 
CCCCCGGGCC 
CCCGGGGCGC 
GGTGTTCACC 
GGCAGGAGGC 
GGCCGGGGAA 
TTTCTTCTCC 
TTGCTCTGCG 
CAGGCGGAGG 
CCCATTCGAT 
GCTCCCTCCT 
GCCGTGGTCC 
GGGTCAGACC 
CCCACAAGCA 
CGGCAGGTGC 
TGACCTTGTC 
CAGGCACAGA 
CTGTTTGCTG 
GCTCCTGATG 
ATCGGGAGCT 
CTCCCTTACA 
GGTGCCCATG 
GGTACACCCT 
CTGCATGACC 
TTTCCTGGAT 
TCTCCTTAGG 
CTCTTCCTCT 
GTGCCCGCCG 
ACATTCCCCC 
ACCACGCAGA 
GACGGTGCCT 
GAGTGTACAC 
AGTCTGTCCC 
CAACCGCACA 
TAGCTTTCCA 
CCAGGTAACC 
GGCTGTCACC 
CACGCCCTCT 
GGCACAGGGA 
CGCTGTCTAT 
CCAGCCTGTG 
GCTACCCACG 
GACACGCCGC 
GCCCTCAGTC 
ATCTGGGTCT 
GACCGGAGGA 
GAAAGCGGCC 
ACAGCCCCTG 
AGAGACCAAG 
GGATCTGACG 
GTTTCTGGCC 
AAAAAAAAAA 



TTGCTTACTC 
CCCGCGTAAA 
CGACATCAGG 
AAGCCTGCCC 
GACTTCGTTG 
CGCGCGGAGC 
CGACCGGCAC 
ACGCTGGCGC 
CACGCCGCTA 
TGTATTCAGG 
ATCTACCTGG 
TGTGCGGGAG 
CCGTAGCGAT 
AACGGGGAGC 
GGACCTGGGC 
CCCGGTGTCT 
CCCTCCCTGC 
CTTTGGCCTC 
GGGCAGCTGG 
TACGAGATGT 
GCTCCTCCAC 
AGCAGCACCA 
GATGCCTTCC 
ATTCACCAAC 
GGACGATGAC 
AAATACCCTC 
GGGGGCTGGC 
CCGACGCGGT 
GGAATACCCC 
GCCCCAGGGC 
CCAACATCTT 
GCAGATGGAC 
GAAGCGTGTC 
TCTTCACCAC 
GACACCCTGA 
AGCCTTCCAG 
CCAGATGAAG 
CCAGCCTCAA 
TGCAGGCAGC 
ACGGCCCACA 
CCCATACACA 
CAGACATAAA 
CACCAACTCC 
TTCACAAGCC 
CCATTCATCG 
ACAGCATGCC 
GGCCCCAACT 
ACCACCGATG 
TACGACATCG 
CCCCACAGAG 
ACCCC I I I I I 
GCGATTATGC 
ACACCTGCCC 
CCTGCTGCCC 
TTCGGCGTCA 
TAGGGGCCAG 
TTCCTTGTGA 
TTTAGCCTAA 
AAAAAAAAAA 



CCTATCCGGG 
CCTGGGTGAC 
CGGCGGCGGT 
AGGAGGCAGG 
GGGAGGGAGA 
GCCTGGGAGA 
CTGGGCGCTG 
TGTCCGGGAC 
CCACTGCTGG 
GCTCATGCGG 
GACCCTGGCG 
GCCCTGGGAG 
GCTGGAAGGG 
GGTGGAGGCA 
ATGGGGAAGC 
GGTGGAGACA 
TGCTGGCCCA 
CGCTTCTCCT 
TGGTACCCTG 
TCTCCTGGTT 
CACGTCAGCA 
GGGGAACCTG 
TGCTGAAGAT 
AAGAACATGC 
GGTCAGCACC 
ATGTCCAAAA 
CAGGCACCAA 
TCTGCATGAG 
GCACCCTCAT 
ACGGAGGTCT 
CAAGCACCCA 
GGTTCAGGAA 
TGCCTTGGAG 
CATCCTACAA 
GCCTCAAGCC 
CTGCAAGTCC 
GAAGGCAACT 
CAGTGGGCAT 
CACATTTACA 
CGCTCACTTG 
ACTACAAGGG 
TATAGTCCAT 
CCTGGATCTG 
ACAGAAACGG 
AACTTCTCAG 
CCCTCCGGGG 
CATGCTCCCT 
TCCACACACC 
TCCTGGCTCC 
GCACAGTCCC 
AAGCACCCTG 
ACAGAGACTT 
AACCCCAACA 
CTCCACACAC 
GAGTCCCCAC 
TTTCCAATTC 
CTTAAGGGTC 
AAAAAAAAAA 
AAAAAAAAAA 



GGCCCAAGGC 
CTCGGAGACA 
GGTCCGGGAG 
GGGCTGGGCT 
CGCCCGGCTC 
GGAGAAGGAG 
CTGCTGGCGC 
CAGGGCCCGA 
GAAACCTCCT 
CTGAGTAAGA 
GCCTGTGGTG 
GTCAGGCTGA 
ACTTTTGATG 
GCTGAGGAAG 
GAGAAGGCGA 
TTCCAGGGGA 
GGCCACCTCC 
ATGAGGATAA 
CTGGGAGTCA 
CCTGCGGCCC 
CCTTGGCTGC 
GATGCTTCGG 
GGCACAGGAG 
TGATGACAGT 
ACGGTCGGCT 
GTGGGTACGT 
GCCTAGGGGA 
GCGCAGCGGC 
GCGGACCACC 
TCCCCCTCCT 
GAAGAGTTCA 
GCATGAGGCG 
AGGGCCTGGC 
GCCTTCTCCC 
CACCGTCAGT 
GTCCCACTGA 
TGGAAGTGGT 
GGACAGGGTT 
CGCCTGCAGT 
ACTCATGCTG 
CCACAAAGCA 
CTGCAATCAC 
CAGCCCACAC 
CCACACATGT 
TGTCCCTGTC 
TCATGCCACC 
CTCTTGGCTA 
CCCAACCACT 
CCAGAGTATC 
CAGCCACCTC 
ATTCTACCAA 
TGGACATACG 
CGTGCTTATG 
ACATACACAC 
TAGACCCAGT 
ACCCTGTCAG 
CGGCTTGGGA 
AAAAAAAAAA 
AAAAAAAAAA 



GCTGTCTCCG 
TCCGTTGGAG 
AAACCCGGCG 
AGCTGCCCCG 
CCGCCCCTAA 
CCGACCTGCC 
TGGCGCTGCT 
GGCCACCTGC 
GCAGCTACGG 
AGTACGGACC 
GTCCTGGTTG 
GGAGTTCAGC 
GCCATGGGGT 
TTTACCATGC 
GGAGCTGATC 
CAGAAGGACG 
AACGTAGTCT 
GGAGTTCCAG 
GCTCCCAGGG 
CTGCCAGGCC 
CTTCACAGTC 
GCCCCGCACG 
GAACAAAACC 
CATTTATTTG 
ATACCCTCCT 
GAGGAGCTGA 
CCGTACCCGC 
TGCTGGCGCT 
CGCTTCCGAG 
TGGCTCCATC 
ACCCAGACCG 
TTCCTGCCCT 
AAAAGCGGAG 
TGGAGAGCCC 
GGCCTTTTCA 
CCTTCACTCC 
GGGTGCCCAG 
AATGTCTCCA 
TGTTTTCCGG 
CTAAGATGCA 
ACTGCTGGGT 
AAGCACATAG 
GTGGGAGTCT 
TCACAGCTCA 
CCTGGTGCCT 
CAGAGACTGT 
CACCACTCTC 
TGTCCACACA 
TTCCCACTGA 
TGCAACTGCA 
ATGCAAACAC 
AGGACCCTCA 
TAACCACGTG 
TCACTGATCT 
GGAAGGGGTT 
GGAGTGAGCC 
ATTAAAGTTT 
AAAAAAAAAA 
AAAA (SEQ ID NO:l) 



FEATURES: 
5'UTR: 1-303 
Start Codon:304 
Stop Codon: 1815 
3'utr: 1818 
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IOMOLOGOUS PROTEINS: 



9] 
9] 
9] 
91 
9] 
9] 
9] 
9] 
91 
9T 



117254 
404777 
479930 
117212 
320075 



SD P24461|CPG1 W .RABIT CYTOCHROME P450 2Gl (CYPIIGl) (P 
gb | AAA31432.il (L10912) cytochrome P-450 2B-Bx [Oryct 
pi T ' I5?5§? 6 cytochrome P450 2B4 isoform Bx - rabbit > 
sp|P00178|CPB4_RABIT CYTOCHROME P450 2B4 (CYPIIB4) (P 
^^^PVM 531277 cytochrome P450 2B4-B1 - rabbit >qi 121369 
^^3X350 1 I . i J (AF128849) cytochrome P450 2bI0 relat 
1 3 ^3191|sp|P04167|CPB2_RAT CYTOCHROME P450 2B2 (CYPIIB2) (P4 



117213 | sp 
89973 | pi r 
2144292 | p 



P12789|CPB5_RABIT CYTOCHROME P450 2B5 (CYPIIB5) (P 
IA27717 cytochrome P450 2B5, hepatic (form HPl) - 
r||04RTP2 cytochrome P450 2B2 - rat 



ESt: 

g6TBEl48597 
' BF359243 
AW753778 



gb 



BE148597 MR0-HT0241-150500-010-b02 HT0241 Homo sapi , 
BF359243 RC6-ET0072-150600-011-F01 ET0072 Homo sapi. 
AW753778 RCl-CT0286-301099-011-f08 CT0286 Homo sapi. 



EXPRESSION INFORMATION FOR MODULATORY USE" 
gb BE148597 heacLneck 
gb BF359243 lung_tumor 
gb AW753778 colon 
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1 MEATGTWALL LALALLLLLT LALSGTRARG HLPPGPTPLP LLGNLLQLRP 

51 GALYSGLMRL SKKYGPVFTI YLGPWRPWV LVGQEAVREA LGGQAEEFSG 

101 RGTVAMLEGT FDGHGVFFSN GERWRQLRKF TMLALRDLGM GKREGEELIQ 

151 AEARCLVETF QGTEGRPFDP SLLLAQATSN WCSLLFGLR FSYEDKEFQA 

201 WRAAGGTLL GVSSQGGQTY EMFSWFLRPL PGPHKQLLHH VSTLAAFTVR 

251 QVQQHQGNLD ASGPARDLVD AFLLKMAQEE QNPGTEFTNK NMLMTVIYLL 

301 FAGTMTVSTT VGYTLLLLMK YPHVQKWVRE ELNRELGAGQ APSLGDRTRL 

351 PYTDAVLHEA QRLLALVPMG IPRTLMRTTR FRGYTLPQGT EVFPLLGSIL 

401 HDPNIFKHPE EFNPDRFLDA DGRFRKHEAF LPFSLGKRVC LGEGLAKAEL 

451 FLFFTTILQA FSLESPCPPD TLSLKPTVSG LFNIPPAFQL QVRPTDLHST 

501 TQTR (SEQ ID NO: 2) 



FEATURES: 

Functional domains and key regions: 
Prosite search results: 

[1] PDOC00004 PS00004 CAMP_PHOSPHO_SITE 

CAMP- and cGMP-dependent protein kinase phosphorylation site 

128-131 RKFT 

[2] PDOC00005 P500005 PKC_PHOSPHO_SITE 
Protein kinase c phosphorylation site 

Number of matches: 6 

1 61-63 SKK 

2 99-101 SGR 

3 248-250 TVR 

4 288-290 TNK 

5 378-380 TTR 

6 473-475 SLK 

[3] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 

Number of matches: 3 

1 119-122 SNGE 

2 192-195 SYED 

3 343-346 SLGD 

[4] PDOC00008 PS00008 MYRISTYL 

N-myristoylation site 

Number of matches: 10 

1 51-56 GALYSG 

2 109-114 GTFDGH 

3 115-120 GVFFSN 

4 188-193 GLRFSY 

5 207-212 GTLLGV 

6 257-262 GNLDAS 

7 284-289 GTEFTN 

8 339-344 GQAPSL 

9 370-375 GIPRTL 
10 444-449 GLAKAE 

[5] PDOC00009 PS00009 AMIDATION 

Amidation site 

Number of matches: 2 

1 140-143 MGKR 

2 435-438 LGKR 

[6] PDOC00029 PS00029 LEUCINE_2IPPER 
Leucine zipper pattern 

Number of matches: 2 

1 32-53 LPPGPTPLPLLGNLLQLRPGAL 

2 39-60 LPLLGNLLQLRPGALYSGLMRL 
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[7] PDOC00081 PS00086 CYTOCHROMES 50 

Cytochrome P450 cysteine heme-iron ligand signature 

433-442 FSLGKRVCLG 

Membrane spannin g structure and domain*;; 
Helix Begin End Score Certainty 

1 5 25 2.305 Certain 

2 64 84 0.783 Putative 

3 170 190 1.041 Certain 

4 292 312 2.031 Certain 

5 448 468 1.133 Certain 
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LAST Alignment to Top Hit: 
>gi |117254|sp|P24461|CPGl_RABIT CYTOCHROME P450 2Gl (CYPIIGl) 
. (P450-NMB) (OLFACTIVE) 

pir||B31944 cytochrome P450 2G1 - rabbit 
Length = 494 

Score = 516 bits (1315), Expect = e-145 

Identities = 248/491 (50%), Positives = 345/491 (69%), Gaps = 3/491 (0%) 
Query: l meatgtwalllalal-lllltlalsgtrarghlppgptplpllgnllqlrpgalysglmr 59 

c-u- * n ME G + + LAL LL+ +A + G LPPGPTP+P LGNLLQ+R A + ++ 
SbjCt: 1 MELGGAFTIFLALCFSCLLILIAWKRVQKPGRLPPGPTPIPFLGNLLQVRTDATFQSFLK 60 

Query: 60 lskkygpvftiylgpwrpwvlvgqeavrealggqaeefsgrgtvamlegtfdghgvffs 119 

c .. C1 L +KYGPVFT+Y+GP RPW+L G EAV+EAL +A+EFSGRG +A +E F GHGV + 
sbjct: 61 LREKYGPVFTVYMGP-RPWILCGHEAVKEALVDRADEFSGRGELASVERNFQGHGVALA 119 

Query: 120 NGERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTEGRPFDPSLLLAQATS 179 
,-u- NGERWR LR+F++ LRD GMGKR EE IQ EA L+E F+ T+G P DP+ L++ S 

SbjCt: 120 NGERWRILRRFSLTILRDFGMGKRSIEERIQEEAGYLLEEFRKTKGAPIDPTFFLSRTVS 179 

Query: 180 nwcsllfglrfsyedkefqawraaggtllgvssqggqtyemfswflrplpgphkqllh 239 

NV+ S++FG RF YEDK+F +++R + + +S+ Q Y+M+S ++ LPG H ++ + 
Sb]Ct: 180 NVISSWFGSRFDYEDKQFLSLLRMINESFIEMSTPWAQLYDMYSGVMQYLPGRHNRIYY 239 

Query: 240 hvstlaaftvrqvqqhqgnldasgpardlvdafllkmaqeeqnpgteftnknmlmtviyl 299 

rU . + L F +v + ++ +LD P RD +D FL+KM Q++ NP TEF KN+++T + L 

SbjCt: 240 LIEELKDFIAARVKVNEASLDPQNP-RDFIDCFLIKMHQDKNNPHTEFNLKNLVLTTLNL 298 

Query: 300 LFAGTMTVSTTVGYTLLLLMKYPHVQKWVREELNRELGAGQAPSLGDRTRLPYTDAVLHE 359 
cU . ^ FAGT TVS+T+ Y LL+MK+P VQ + EE+N+ +G + PS+ DR ++P+TDAV+HE 

SbjCt: 299 FFAGTETVSSTLRYGFLLIMKHPEVQTKIYEEINQVIGPHRIPSVDDRVKMPFTDAVIHE 358 

Query: 360 aqrllalvpmgiprtlmrttrfrgytlpqgtevfpllgsilhdpnifkhpeefnpdrfld 419 

c-u- + , frt QRL +VPM6+P ++R T FRGY LP+GT+VFPLLGS+L DP F HP++F P FLD 
SbjCt: 359 IQRLTDIVPMGVPHNVIRDTHFRGYLLPKGTDVFPLLGSVLKDPKYFCHPDDFYPQHFLD 418 

Query: 420 ADGRFRKHEAFLPFSLGKRVCLGEGLAKAELFLFFTTILQAFSLESPCPPDTLSLKPTVS 479 
ru- *m GRF+K+EAF+PFS GKR+CLGE +A+ ELFL+FT+ILQ FSL PP + + P +S 
SbjCt: 419 EQGRFKKNEAFVPFSSGKRICLGEAMARMELFLYFTSILQNFSLHPLVPPVNIDITPKIS 478 

Query: 480 GLFNIPPAFQL 490 

G NI PP ++L 
SbjCt: 479 GFGNIPPTYEL 489 (SEQ ID NO: 4) 

Hmmer search results (Pfam): 

Scores for sequence family classification (score includes all domains): 

^notion Score E-value N 

PF00067 Cytochrome P450 594~4 6^96-175 ~1 

Parsed for domains: 

Model Domain^ seq-f seq-t hmm-f hmm-t score E-value 

PF00067 1/1 33 493 . . ~~"l ~497 [] 594~4 6~9e~175 
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1 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1901 NNNNNNNNNN NNNNNNNNNN NNNNNTGACA GGGGCCATGA TGGAGACACC 
1253- T^GATCGAA GAGGTCACAG CACCCTCCTC TTTCTTCCTC CCTACCCCCA 
2001 GCTGAGTAAG AAGTACGGAC CGGTGTTCAC CATCTACCTG GGACCCTGGC 
2051 GGCCTGTGGT GGTCCTGGTT GGGCAGGAGG CTGTGCGGGA GGCCCTGGGA 
2101 GGTCAGGCTG AGGAGTTCAG CGGCCGGGGA ACCGTAGCGA TGCTGGAAGG 
illi GACTTTTGAT GGCCATGGTA AGTCAAGGGC TGCTAGGCCC TCCGCTCACA 
2201 GCCTGCCACC ACTTACTGGT GTGTGACCTT TGCACATGGC TTAGTCCCTC 
HI} TGTTGCGTGA TCTGTCAAAT GGAGTGATAA CAGTGCCCAT CAGCCGGGTG 
2301 CAGTGGCTAG TGCCTGAAAT CCCAACACTT TGGGAGGCGG AGGTGGGTGG 
HI} ATCACTTCAG GTCAGGAGTT CGAGACCAGC CTGGCCAACA TGGTGAAACC 
2401 CTGTCTCTAC TAAAAATATA AAAATTAGCT GGGCATGGTG GTGCGTACCT 
2451 GTAATCCCAG ATACTTGGGA GGTTGAGGCA GGAGAATCGC TTGAACCCGG 
2501 GAGGCAGATG TTGCAGTGAA CCAAGACTGT GCCACTGCAC TCCAGTCTGG 
2551 GCAACAGAGT GAGCCTCCAT CTCAAACAAA CAAACAAAAA GCAGTGCCCA 
2601 TCATGTAGGA TTGAGTGATT GAGTGAGGAC TGAGCCTTGT GCAAAGTGAG 
^51 CACTCACTAA TCACCAGGTT GTAGTATCAG TGATAACCAT CAATGATCCA 
2701 GGTAAAGCCC TGAGGGTTCA GAAAGATGCC GGAGCGCTTT CAAGGTGCTG 
2751 GGGATTGGTG GGCAAGCCCT CGAATAATAG AAACAGTTCT CTGTATTACA 
2801 ACAGAAAGCA GGAGGCCCAT GCTGGGTGCT GCCAGGAACT CAGTAGTAAC 
HI} TAAGACAGCA CCGGTGCTGC TTCCCCAGCG CACCTAGGCC AGTGGGGAAA 
2901 CAGACTCACC ACACAGTCCC AGCCCAGAGT GGTCAGGGCC AAGATGGGGA 
2951 AGCACGGGGA GAAAGGTCAG GGTGGGATGG GGAGGGGTCA GGGCAAGAGG 
3001 GGTCAGGGCC AGGCTGAGGG AAGCCCTGGG ACTGTAGGAA TTTAGAGGAG 
111} GTACCTGACG CGGCATGTTT GGTGAGGGAG ATTCAGGAAG TCTTCCTGGA 
3101 AGAGAGGCTG TCGGAGCTGA GACTCATAAG ATGAGTGGGG AGGGTGTTCC 
3151 AGGCAGAAAG ACCAGCACCT ACAAAAGCAT GACTTTGAGA GAAGCATTCA 
3201 TCCATTCAAC TGATGAATTT TCAGACTGGG CACGCTGGCT CATGCCTGTA 
HI} ATCCCAGCAG TTTGGAAGGC TGAATGGGGA GGATGACTTG AGCCTAGGCA 
3301 TTTGTGACAA GCCTGGGCAA CATGGTGAGA CCCTGCCTCC ACAAAACAAA 
3351 CAAACAAACA AAAAATCATT ATACCTGGTA CCATGGGTAC CAGGTACATA 
3401 GAAATGACTC AGGCAGATAT GGTGTCCTCT CCTACTGTGG GAGAGGCGGG 
Irl} CTTATACTGC AGTAAGACAA TAGAGGGAGG GAATATAATC CTAAAATGAG 
3501 AGGTACAGAT TTGAGAGCAA ACACAGGGCA CAGGCATATG TACGAGGGTA 
3551 AAGAGGGAAT CAGGGAAGGC TTCTCAGAGA AGGTGACATT TAAGCCGGGA 
3601 CATGAAGGAT GAACGAGTTA GTTCACCAAG GATGGGATGG AAAGGGGTGA 
3651 GAGTGATGGA GGCAGAGGGA ACTGCAGGAT CATAGGCCTA GACAGGGGAT 
3701 CCTGACGCCC TTGAGGAAGT GAGAGAAGAC CAGCGCAGTC GTAGTGGGTT 
111} AAGTAACAAA GCTGAGAAGC CAGGGAAATC CCTGGTCATG CAGGGCCTGT 
3801 GAGTCACGTC AGAGTGTTTG GGCTTTTGTT TTTCCTGGGA GCAGTCGATT 
3851 TTAAGCAGGG AACAGCTGTA TTCAGAGTTG GGAAGATCCT GTGGTTGCTG 
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TCTTTTCTCT 
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NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TTCTTCCTTT 
CTCTCTTTCT 
TGGGGTGCAG 
TCAAGTGATC 
ATACCACCAT 
GTTATATTGC 
ACCTCAGCCT 
ACTCAACACT 
CCTGGCACCC 
TGGGAAGGGG 
CTTCCCCCTC 
CAGAAGAGTT 
AAGCATGAGG 
GTATCACAAG 
CACCCTTCTG 
CCCGTGGGCC 
CTCCTCACCA 
AGCTCTTCCT 
CCGTGCCCGC 
TCAACATTCC 
TCCACCACGC 
CAGGACGGTG 
CCAGAGTGTA 
CGGAGTCTGT 
GCACAACCGC 
GGTTAGCTTT 
TAGCCAGGTA 
TCTGGCTGTC 
TCACACGCCC 
CCTGGCACAG 
TGTCGCTGTC 
CTCCCAGCCT 
ACAGCTACCC 
TGAGACACGC 
GCAGCCCTCA 
CACATCTGGG 
TCAGACCGGA 
GTGGAAAGCG 
TCTACAGCCC 
GTTAGAGACC 
GCCGGATCTG 
TTTGTTTCTG 
TGAGTAAGGG 
GGAGCTGAAA 
AGATGTGGAG 
TTCCAAAAAG 
TAACCCAAGC 
CTGAAGCCAA 
GAACTAGCTT 
TTTTCTCTTG 
CTTTCTTTTT 
CCCAGGCTGG 
CCGGGTTCAA 
AGGTGTGTGC 
GGTTTCACCA 
CTGCCAGTCT 
CTAGATAAAT 
AGAAAGATAC 
AGTCATCAAC 
I I I I I I 



NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
CTTCCTTCCT 
CTCTTTCTTT 
TGGTACAAGC 
CTCCCACGTC 
GCCTGGCTAA 
CCAGGCTGGT 
CCCACGTGGC 
CCACAAATGT 
GAGACTTGTA 
TTTATAGTTT 
CTTGGCTCCA 
CAACCCAGAC 
CGTTCCTGCC 
CAGGTGCTGG 
CACCCTGGGC 
TGGGTGTGAG 
GGGAAGCGTG 
CTTCTTCACC 
CGGTACACCC 
CCCAGCCTTC 
AGACCAGATG 
CCTCCAGCCT 
CACTGCAGGC 
CCCACGGCCC 
ACACCCATAC 
CCACAGACAT 
ACCCACCAAC 
ACCTTCACAA 
TCTCCATTCA 
GGAACAGCAT 
TATGGCCCCA 
GTGACCACCG 
ACGTACGACA 
CGCCCCCACA 
GTCACCCCTT 
TCTGCGATTA 
GGAACACCTG 
GCCCCTGCTG 
CTGTTCGGCG 
AAGTAGGGGC 
ACGTTCCTTG 
GCCTTTAGCC 
GTGGGGACAG 
AACATGGCCA 
CTGGGAGGAG 
GGCCAAGGTG 
TGGACTGTGG 
CTAGATCCAG 
GCAACACAGA 
CCTCCCCTTC 
I I I I I 



mr 



TTT 
AGTGCAGTGG 
GCCATTCTCC 
CACCACGCCC 
TGTTGGCCAG 
CGGCCTCCCA 
TTCTTATCTA 
AATGAGCCCC 
TTTCTGCAGC 
I I I I IATTTT 
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GAGATAGGGT 
CATAGTTCAC 
CGGCCTCCAA 
ATTTCTTATT 
CTGGTCTCAA 
GCTGGGATTA 
TATCTATTGC 
CTCACTCTGT 
TAACCTCCGC 
TAGCTGGAAT 
GTAGTAGAGA 
GACCTCAGGT 
GTATGAGCCA 
TATTATTATT 
GGCTGGAGTG 
GGTTCAAGCG 
CATGCACCAC 
TCTCCAGGTT 
CCATCTCGGC 
TGGCCTTAAA 
ATTTCTGAGA 
TTTCAAACAT 
ACTCCTGCCT 
CGCCTGGCTA 
AGCCAGGCTG 
CTCCCAAGTG 
GTAGACTCTT 
AGAGAATGAC 
TGTGGAGACT 
GCAGCCCTGT 
ACCAACCTGG 
TTAGCCAGGC 
GAGGCAGGAG 
GATCGTGCCA 
AAAAAAAAAG 
GACAAAACTT 
TTTACTTCCC 
AGGAGGAACA 
ACAGTCATTT 
CGGCTATACC 
GCCTTTTCCA 
AGCAAGGTGG 
GCTGATCACT 
AC (SEQ ID 



TTTGCTTTGT 
TGGGGCTTCA 
GCAGCTGGGA 
TTTATTTTCT 
ATTCCAGGCC 
TAGGCGTGAG 
I I I I IATTTA 
GGCCCATGCT 
CTCTTAGGTT 
TACAGGTGCC 
TGGGGTTTTG 
GATCTGCCCA 
CCGTGCCTGG 
Al I I I I I 1AT 
CAATGGCGTG 
ATTCTCCTGC 
CATGTCCAGC 
GGTCAGGCTG 
CTCCCAAAGT 
GATTATTTTA 
GTTAAAGATA 
GCAGCAAAGT 
CAGTCTCCCA 
Al I I I I IGTA 
ATCTCAAACT 
CTGGGACTAC 
GATATTAGTT 
TAATAACAAA 
TTGAAACCCA 
GGAAAATGGT 
CCAACATGCA 
ATGGTGGTGC 
AATTGCTTGA 
CTGCACTCAG 
TCTACTTCCC 
TAACTTGTGT 
AAAGGCTGTG 
AAACCCAGGC 
ATTTGCTGTT 
CTCCTGCTCC 
CTTGCCAGGC 
CTCACGCCTA 
TGAGAATAGG 
NO: 3) 



TGCCCAAGCT 
GACTCCTAGG 
CTACAGATGC 
ATAGAGAAAG 
TCAAGAGTTT 
TCACTGCACC 
CTTATTTATC 
GGAGTGCAGT 
CAAGCAGTTC 
TGCCACCAAG 
CCATGTTGAC 
CCTTGGCCTC 
CCACCTATTG 
TTGCAGATGG 
ATCTCAGCTC 
CTCAGCATCC 
TAATTTTGTA 
GTCTCAAACT 
GCTGGGATTA 
AGGCAAATTA 
I I I I I GCCCT 
TGAGGGAATT 
AGCAGCTGGG 
M I I IAGTAG 
CCTGGCCTCA 
AGGCGTGAGC 
CTCTCTGGTT 
AACACAGAAA 
TGTATACCAT 
ACAGCAGTTA 
GAAACCCCGT 
GCACCTGTAA 
ACCCAGGAGG 
CCTGGGCAAC 
AACCTTCCCA 
TTCCGACCCC 
TTCTCTCAGC 
ACAGAATTCA 
TGCTGGGACG 
TGATGAAATA 
CTTGGGAACA 
TAATCCCAGC 
AGTTTAAGAC 



GGGGTGCAGT 
CTCAAGTGAT 
GTGCCACCAC 
TCTCACTATA 
CCATCCCAGC 
CTGCCCTAAT 
TTTTAI I I I I 
GGCATCATCT 
TCCTGCCTTG 
CCTGGCTAAT 
CAGGCTGGTC 
CCAAAGTGCT 
CI I I I IAAAG 
AGTTTCGCTA 
ACCGCAACCT 
CTAGTAGCTG 
Till IAGTAG 
CCCAACCTCA 
CAGGTGTGAG 
CAGAAAGCAA 
TGACATTTTA 
GTACAAGGAA 
ACTACAGGTG 
AGATGGAGTT 
AGTGATCCAC 
CACTGTGCCT 
GAAATGTTTT 
GTTATAAGGA 
TGGTGGGAAT 
CCTGAGGTCA 
CTCCATTAAA 
TCCCAGCTAC 
CGGAGGTTGC 
AAAGCAAGAC 
AAAATTTATC 
AGGCTTGGCT 
CCCTCTGCCT 
CCAACAAGAA 
ATGACGGTCA 
CCCTCATGTC 
GAAGTCAGGG 
ACTTTGGGAG 
CAGCCGGCCA 



AGCATGATCT 
CCTCCCGCCT 
ACCCAGCTAA 
CAGCCCTGTG 
CTCCCAAAGT 
A I I I I TATTT 
GAGACAGAGT 
CGGCTCACTG 
ACCTCCCGAG 
I I I I I IATTT 
TCGAACTCCT 
GAGATTACTG 
ATTATTTT 



TTGTTGCCCA 
CCGCCTCCCA 
GGATTACAGG 
AGACGAGGTT 
GGTGATCCGC 
CCACCGCGCC 
TTTAATGCAC 
TGAGGACAGT 
CACCTTGTGC 
CCCGTCACCA 
TCATCGTGTT 
CCGCCTCGGC 
GGCCAGGTAA 
TAAAAATGAA 
TTGATGAAGA 
GTGAAACGAC 
GGAGTTTGAA 
TGTACAAAAA 
TCGGGAGGCT 
AGTGAGCCGA 
TCTGTCTCAA 
TAAACCCCGT 
GTTCTGGACA 
GGTTTCTTTC 
CATGCTGATG 
GCACCACGGT 
CAAAGTAAGA 
TTCTAGGCTG 
GCTGAGGCGG 
ACACAGTGAA 
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Exon : 
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Exon : 


13442 


13542 


intron: 


13547 


13648 


Exon: 


13547 


13648 


Stop: 


13649 
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6385 

6813 

7853 

12973 

12973 

13012 

13072 

13370 

13682 

14631 



C 
C 
C 
G 
A 
G 
A 
C 
C 
C 



Context: 

DNA 

Position 

2226 TGACAGGGGCCATGATGGAGACACCTTGGATCGAAGAGGTCACAGCACCCTCCTCTTTCT 

I gg I g cctacccccagctgagtaagaagtacggaccggtc 
g t gg ^ gggctgtggtggtcctg gttgggcaggaggctgtgcgggaggccctgggaggtca 
ggctgaggagttcagcggccggggaaccgtagcgatgctggaagggacttttgatggcca 
tggtaagtcaagggctgctaggccctccgctcacagcctgccaccacttactggtgtgtg 

LG,C,A] 

GG TIT GGA ^T GGG ™ 
GGA T GAGCCGGGTGCAGTGGCTA ^^ 

G I GGATCAC ^ GAGGTCAGGAGTT CGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCT 
G T AG I A f^ TATAAAA ^^ 

GGGAGGTTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGATGTTGCAGTGAACCAAGA 

2226 TGACAGGGGCCATGATGGAGACACCTTGGATCGAAGAGGTCACAGCACCCTCCTCTTTCT 
T£ G T? GG I5 CCCCAGCTGACT ^ 
G I G $ GGGGG T2^ 
$ GG I GA 5 GAG ^ CAGCGGCCGGGGAA ^^ 

TGGTAAGTCAAGGGCTGCTAGGCCCTCCGCTCACAGCCTGCCACCACTTACTGGTGTGTG 
H- | A , uSTJ 
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CCTTTGCACATGGCTTAGTCCCTCTGTTGCCTCATCTGTCAAATGGAGTGATAACAGTGC 
CCATCAGCCGGGTGCAGTGGCTAGTGCCTGAAATCCCAACACTTTGGGAGGCGGAGGTGG 
GTGGATCACTTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCT 
CTACTAAAAATATAAAAATTAGCTGGGCATGGTGGTGCGTACCTGTAATCCCAGATACTT 
GGGAGGTTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGATGTTGCAGTGAACCAAGA 

AAACAGTTCTCTGTATTACAACAGAAAGCAGGAGGCCCATGCTGGGTGCTGCCAGGAACT 

CAGTAGTAACTAAGACAGCACCGGTGCTGCTTCCCCAGCGCACCTAGGCCAGTGGGGAAA 

CAGACTCACCACACAGTCCCAGCCCAGAGTGGTCAGGGCCAAGATGGGGAAGCACGGGGA 

GAAAGGTCAGGGTGGGATGGGGAGGGGTCAGGGCAAGAGGGGTCAGGGCCAGGCTGAGGG 

^GCCCTGGGACTGTAGGAATTTAGAGGAGGTACCTGACCCGGCATGTTTGGTGAGGGAG 
[A t T,GSC] 

TTCAGGAAGTCTTCCTGGAAGAGAGGCTGTCGGAGCTGAGACTCATAAGATGAGTGGGGA 
GGGTGTTCCAGGCAGAAAGACCAGCACCTACAAAAGCATGACTTTGAGAGAAGCATTCAT 
CCATTCAACTGATGAATTTTCAGACTGGGCACGCTGGCTCATGCCTGTAATCCCAGCACT 
TTGGAAGGCTGAATGGGGAGGATGACTTGAGCCTAGGCATTTGTGACAAGCCTGGGCAAC 
ATGGTGAGACCCTGCCTCCACAAAACAAACAAACAAACAAAAAATCATTATACCTGGTAC 

ATCCTAAAATGAGAGGTACAGATTTGAGAGCAAACACAGGGCACAGGCATATGTACGAGG 

GTAAAGAGGGAATCAGGGAAGGCTTCTCAGAGAAGGTGACATTTAAGCCGGGACATGAAG 

GATGAACGAGTTAGTTCACCAAGGATGGGATGGAAAGGGGTGAGAGTGATGGAGGCAGAG 

GGAACTGCAGGATCATAGGCCTAGACAGGGGATCCTGACGCCCTTGAGGAAGTGAGAGAA 

GACCAGCGCAGTCGTAGTGGGTTAAGTAACAAAGCTGAGAAGCCAGGGAAATCCCTGGTC 
[A,T] 

TG^GGGCCTGTGAGTCACGTCAGAGTGTTTGGGCTTTTGTTTTTCCTGGGAGCAGTCGA 
TTTTAAGCAGGGAACAGCTGTATTCAGAGTTGGGAAGATCCTGTGGTTGCTGCCTGAAGG 
GGATGAAACTGGAGGCTAGGAGCCCAGGGTGATAGGGAGGATCCAGGGTGATGGGGAGGC 
TGGGAGGTCCGCGGTGATGGACCAGGGCTGGGGCCAGGGGATGGGGAGGAAGGAGTAATT 
GGGAGAGGCCTGGGGCTCTGGCCGAGGAATGGATGGTGGGCTGAAACAGGGAGAGGAGAG 

ATCATAG GCCTAG AC AG GGG ATC CTG ACG C C CTTG AG G AAGTG AG AG AAG AC C AGCG CAG 

TCGTAGTGGGTTAAGTAACAAAGCTGAGAAGCCAGGGAAATCCCTGGTCATGCAGGGCCT 

GTGAGTCACGTCAGAGTGTTTGGGCTTTTGTTTTTCCTGGGAGCAGTCGATTTTAAGCAG 

GGAACAGCTGTATTCAGAGTTGGGAAGATCCTGTGGTTGCTGCCTGAAGGGGATGAAACT 

GGAGGCTAGGAGCCCAGGGTGATAGGGAGGATCCAGGGTGATGGGGAGGCTGGGAGGTCC 
LT,C, GJ 

CGGTGATGGACCAGGGCTGGGGCCAGGGGATGGGGAGGAAGGAGTAATTGGGAGAGGCCT 
GGGGCTCTGGCCGAGGAATGGATGGTGGGCTGAAACAGGGAGAGGAGAGATGCTTAGGCC 
ACTTTGGAACACAGTAGGGCAAGGACAGGAGACACCCAAGGGGAAGTGCCCAAGAGACCA 
CGACAGGCTGGCATTGGACAGGGAAGGTCTGTCTGGAGCAGGTGTCTTGGATAAGGGAGG 
AAAATGGTGCAGTTCCATCCTCCTCCCTCTCTGTTCAACCTCTAAACTACATGGGGCACA 

AGTTTGATTATGCATTTATTGAGCACCTACTGAGTCCTCATCCCTGGGCTAGGCTGGAAT 
GGACTCAGATGGAGCCTGAAGAGTCCCCCTCAGGGAACCTCACTAGAAAGAAGGAGGAAT 
CGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAACACTTTGGGAGGCTGAGGTGGGTGG 
ATCACAAGGTCAGGAGATCGAGACCATCCTGGCTAACACAGTGAAACCCCATCTCTACTA 
AAAA l ALAAAAAATGAGCCAGGCATGGTGGCGGGCGCCTGTAGTCCCAGCTACTCAGGAG 
LT,C,GJ 

CTGAGGCAGGAGAATTGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGACGAGATCACGC 
CACTGCACTCCAGCCTGGGCAACAGAGCGAGATTCCGTCTCAAAAAAAAAAAGAAAGAAA 
GGAA^GGGGGAATGGGGGAGAGGGGCCGGTCCCTTTTTGAGTCTAGCCTTCTGCGCAG 
GGGTTTTCTTCTCCAACGGGGAGCGGTGGAGGCAGCTGAGGAAGTTTACCATGCTTGCTC 
TGCGGGACCTGGGCATGGGGAAGCGAGAAGGCGAGGAGCTGATCCAGGCGGAGGCCCGGT 

CTTTGGGAGGCTGAGGTGGGTGGATCACAAGGTCAGGAGATCGAGACCATCCTGGCTAAC 
ACAGTGAAACCCCATCTCTACTAAAAATACAAAAAATGAGCCAGGCATGGTGGCGGGCGC 
CTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTTGAACCCGGGAGGCAGA 
GGTTGCAGTGAGACGAGATCACGCCACTGCACTCCAGCCTGGGCAACAGAGCGAGATTCC 
GTCTCAAAAAAAAAAAGAAAGAAAGGAAGAAGGGGGAATGGGGGAGAGGGGCCGGTCCCT 
[T,G,A,C] 

TTTGAGTCTAGCCTTCTGCGCAGGGGTTTTCTTCTCCAACGGGGAGCGGTGGAGGCAGCT 
GAGGAAGTTTACCATGCTTGCTCTGCGGGACCTGGGCATGGGGAAGCGAGAAGGCGAGGA 
GCTGATCCAGGCGGAGGCCCGGTGTCTGGTGGAGACATTCCAGGGGACAGAAGGTCAGCA 
TGGCGGGGTCACCCCAGGGTCTCCAGCCGAGTGAAAGGGAAAACTCTCCTACTGTGGCTG 
GGGGTGGCCCCAACCCAGGTCCTGGAATGGGCAGGAGGGGAAGCCTTGAACTCTAGGGCT 



TCCCTTTTTGAGTCTAGCCTTCTGCGCAGGGGTTTTCTTCTCCAACGGGGAGCGGTGGAG 

GCAGCTGAGGAAGTTTACCATGCTTGCTCTGCGGGACCTGGGCATGGGGAAGCGAGAAGG 

CGAGGAGCTGATCCAGGCGGAGGCCCGGTGTCTGGTGGAGACATTCCAGGGGACAGAAGG 

TCAGCATGGCGGGGTCACCCCAGGGTCTCCAGCCGAGTGAAAGGGAAAACTCTCCTACTG 

TGGCTGGGGGTGGCCCCAACCCAGGTCCTGGAATGGGCAGGAGGGGAAGCCTTGAACTCT 
LA.CT.GJ 

GGGCTGGCCTGGGGGTTCTGTTCACTGCCACCTTCTGTCTCTGTCCCACTGTCTCTCCGA 
GGCTGTCATGACATCTCT CTGT GTGTCTCTGGTGCTATCATCCCATTCTTCCTGGGTCTC 
CATCTCTCTCTCTGTCTCTTTTCTTTCTCTCTCCTTTCCTCTATTTTTTGGGCCCTCAGT 
CTATCTCTGTTTCTGTCTCCCTGTCTGTGTGATGGTCACTCTGTTTCTTTCTCCCTGTCT 
GTTTCTCTGTCCCTATCTGTCTGTATCCTTCTTTGCCTGTTTAGCTCTCTCCCTGCGCTG 
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CTGTCTCTCCGAGGCTGTCATGACATCTCTCTGTGTGTCTCTGGTGCTATCATCCCATTC 

TGGGCCCTC^GTCTATCTCTGTTTCTGTCTCCCTGTCTGTGTGATGGTCACTCTGTTTCT 

CTCCCTGCGCTGTCCATCCATCTTTCCCTGCCTCCCTGTCTCTCTCTGGTTGGGTTCAGC 
LC,G,AST] 

CCAACCTGCTCCCCTCTGCCTGGCTCCATCACAGCCTACCTCCCTGCCCCCATTCCCCCC 
^^^SS C ^^ CGATCCCTCCCTGCTGCTGGCCCAGGC CACCTCCAACGTAGTCTGCTC 
9?T G ^T$T^ GGG ^ TGGGG ^~ r ^ TG ^TATGAGGATAAGGAGTTCCAGGCCGTGGTCCGGGC 
AGCTGGTGGTACCCTGCTGGGAGTCAGCTCCCAGGGGGGTCAGGTGAGTGGGTGGGACCC 
CTCTCCAACTACCTTCCCTGAAGGTTCCTGCCAAGGTCCCATGAGAACTAGCTGCCCTTC 

G I GGG I GGGACCCCTCTCC ^^^ 

CTAGCTGCCCTTCTCCCCACAGACCTACGAGATGTTCTCCTGGTTCCTGCGGCCCCTGCC 

agg ^ cccacaagwgctcctc caccacgtcagcaccttggctgccttcacagtccggca 

GGTGCAGCAGCACCAGGGGAACCTGGATGCTTCGGGCCCCGCACGTGACCTTGTCGATGC 

9^S C I5 CTGAAGATGGCACAGGTGT GGGAAGGGTGCAGGGACCCCCTCTCTGAATGGGC 
l_C,T, GJ 

TGGTGACCTGGCAGGTCCCAGCCAGGTGTCCCTGGGGACCTCAATTGGGTTCCTCTCTCT 
TTCTCTCTCTGCATGTCTCTGTGAGTATGAGTGTCTCTGTGCATGTGTGTGCATCCCTTC 
TCTGCACATCTGTGCTGGCCCTTTCAGGGCGTTGCTCTCACTGCCTCTCCCGCCCCCGAC 
CTGGGCATTTGTGCCGGGCTGTCTGTCTCTCCAGCATCTCTCCTCTTTCTCCCTCCCACC 
TCGGCCCTTGTGTTCAGGCCCCATGCCCAGGGTCCTACACCAGCAATCCCCAGGATCACT 

^T G T GG T GGCCCmCAGGGGG ^GCTCTCACTGCCTCTCCCGCCCCCGACCTGGGCA 

TTTGTGCCGGGCTGTCTGTCTCTCCAGCATCTCTCCTCTTTCTCCCTCCCACCTCGGCCC 

TTGTGTTC AG GCC C C ATGCCC AGGGTCCT AC ACC A GC AATC C CCAGG ATC ACTTC ATC C C 

ATCCCCTGCAGCCTCCCCAGACTTTTATGTAAATTCACAATTTTATGTGMTTATGGTCA 

TyATTAGGAAGCCTTGCAATATCAAGTTATGTTAATAAAGTCCACTTTATTAATTATAT 
LC,T,AJ 

Tr^r^A^L?TI CCI ' 11 ' CTTTTCTTTTTAAAGAGACAGGATCTCTTTC 
T G ^ GGGGAGGCTAGAGTA CAGTTGCAAAA^ 

GGTG ^ GCAATCC TCCTGCGT CGGG CTCCTGAGTAGCTGGGACAACAGGTGTGCACCACC 
A $ A ^I^^ TAAA ' ' ' " 1 1 CTTTGTAGAGATAGACTCTCACTATGTTACCCAGG 
CTGGTCTTGAATTCCTGGGCTCATGTAATCCTCCTGCTGCCTTGAACTCCCAAAGTGCTG 

TCTTTAATTAA^CAGOTCCCTGTTTGTGGAGAAAGAAAAATTTTTCTT 
^CCCATpTTTTTCTCT^ 

T^CGTTTACCATCTTAACCATTTTTAGGTATACAGTAWGTAGTGTTCAGTACATTCAT 
ACTGTTATGC AATC AGTCTC C AG AACTCTTC ATGTTGCAAAG CTG AAACTCTAT AC C CAT 
TAAACAACTGCCTGTTCCTCCCTCCTCCAACCCCTGGCAATCACC I II I GAGA 

GAAGTCTCACTCTGTCACCCAGGCTAGAGTGCGGTGGCTCGATCTCGGCTCACTGCAAGC 

IS GGG I GGGGGG ™ 

GTCCCCGTCACCACGCCTGGCTAATTTTTTGTATTTTTAGTAGAGATGGAGTT^ 

G IJ AGGGAGGG ^ GA ^ G ^ GAAAG ^ GG TGGCCTCAAGTGATCCACCCGCCTCGGCCTCCCAA 

rtoTvaCT^GvjACTACAGGCGTGAGCCACTGTGCCTGGCCAGGAAGTAGACTCTTGATATTA 

GGG T GGAG T GG T A ^ 

C^CGTCAGCCTCCTGAGCAGCTGGGACAACGGGCTCATACCACCATGCCTGGCTAATT 

GGCTCAMTGCTTCTCTCACCTCAGCCTCCCACGTGGCTGGGATTACAGGCATGAGCCAC 
LG , C , TJ 

G ^GCCACTC^CTC^ 

GGGAG ^ GAGAG ^ G TACTCCACACTCGAGGACCAAATAGACTGGGGTGGGAAGGGGTTT 
ATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGGTCTTCCCCCTCCTTGGCTCCATCC 
TGCATGACCCCAACATCTTCAAGCACCCAGAAGAGTTCAACCCAGACCGTTTCCTGGATG 
CAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGCCCTTCTCCTTAGGTATCTGCTGCA 
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CTGTCTTCCTCTCTTTCTCTCTTTCTTTCTTGACAGGGTCTCTCTTTGTCTCCCAGGCTG 

GGGTGCAGTGGTACAAGCATAGCTCACAGCAGCCTTGAACTCCTAGGCTCAAGTGATCCT 

CCCA CG TCAGC CTCCTGAGCAGCTGGGACAACGGGCTCATACCACCATGCCTGGCTAATT 

TTTTAATTTTTCGTAGAGACAAGGTCTTGTTATATTGCCCAGGCTGGTCTCAAACTCCTG 

GGCTCAAATGCTTCTCTCACCTCAGCCTCCCACGTGGCTGGGATTACAGGCATGAGCCAC 
[A,G,TJ 

GCACGCCACTCAACACTCCACAAATGTTGATGCCATTATGTTTTGTGAACTAGTGTCCCT 
GGCACCCGAGACTTGTACTCCACACTCGAGGACCAAATAGACTGGGGTGGGAAGGGGTTT 
ATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGGTCTTCCCCCTCCTTGGCTCCATCC 
TGCATGACCCCAACATCTTCAAGCACCCAGAAGAGTTCAACCCAGACCGTTTCCTGGATG 
CAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGCCCTTCTCCTTAGGTATCTGCTGCA 



CTCTCTTTGTCTCCCAGGCTGGGGTGCAGTGGTACAAGCATAGCTCACAGCAGCCTTGAA 
CTCCTAGGCTCAAGTGATCCTCCCACGTCAGCCTCCTGAGCAGCTGGGACAACGGGCTCA 
TACCACCATGCCTGGCTAATTTTTTAATTTTTCGTAGAGACAAGGTCTTGTTATATTGCC 
CAGGCTGGTCTCAAACTCCTGGGCTCAAATGCTTCTCTCACCTCAGCCTCCCACGTGGCT 
GGGATTACAGGCATGAGCCACTGCACGCCACTCAACACTCCACAAATGTTGATGCCATTA 
[ G,A, CsT] 

GTTTTGTGAACTAGTGTCCCTGGCACCCGAGACTTGTACTCCACACTCGAGGACCAAATA 
GACTGGGGTGGGAAGGGGTTTATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGGTCT 
TCCCCCTCCTTGGCTCCATCCTGCATGACCCCAACATCTTCAAGCACCCAGAAGAGTTCA 
ACCCAGACCGTTTCCTGGATGCAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGCCCT 
TCTCCTTAGGTATCTGCTGCAGCCCTGGGTATCACAAGCAGGTGCTGGCGAACTCCAGGC 

CTCCTAGGCTCAAGTGATCCTCCCACGTCAGCCTCCTGAGCAGCTGGGACAACGGGCTCA 
TACCACCATGCCTGGCTAATTTTTTAATTTTTCGTAGAGACAAGGTCTTGTTATATTGCC 
CAGGCTGGTCTCAAACTCCTGGGCTCAAATGCTTCTCTCACCTCAGCCTCCCACGTGGCT 
GGGATTACAGGCATGAGCCACTGCACGCCACTCAACACTCCACAAATGTTGATGCCATTA 
TGTTTTGTGAACTAGTGTCCCTGGCACCCGAGACTTGTACTCCACACTCGAGGACCAAAT 
[A,T,CSG] 

GACTGGGGTGGGAAGGGGTTTATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGGTCT 
TCCCCCTCCTTGGCTCCATCCTGCATGACCCCAACATCTTCAAGCACCCAGAAGAGTTCA 
ACCCAGACCGTTTCCTGGATGCAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGCCCT 
TCTCCTTAGGTATCTGCTGCAGCCCTGGGTATCACAAGCAGGTGCTGGCGAACTCCAGGC 
ATCTGTGCCAGCTGGGGGCACCCTTCTGCACCCTGGGCTTACTGTTGGCTCCTCCACCTG 

ATAGACTGGGGTGGGAAGGGGTTTATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGG 
TCTTCC CCCTCCTTGG CTC CATC CTGC ATG AC CC C AACATCTTC AAGC AC C CAG AAGAGT 
TCAACCCAGACCGTTTCCTGGATGCAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGC 
CCTTCTCCTTAGGTATCTGCTGCAGCCCTGGGTATCACAAGCAGGTGCTGGCGAACTCCA 

GGCATCTGTGCCAGCTGGGGGCACCCTTCTGCACCCTGGGCTTACTGTTGGCTCCTCCAC 
[C,G] 

TGCTGTTCCCCCCGTGGGCCTGGGTGTGAGGAATACTGACTCAGCCCTCTCTCTCTCTCT 
CTCCTCACCAGGGAAGCGTGTCTGCCTTGGAGAGGGCCTGGCAAAAGCGGAGCTCTTCCT 
CTTCTTCACCACCATCCTACAAGCCTTCTCCCTGGAGAGCCCGTGCCCGCCGGTACACCC 
iGAGLLTCAAGCCCACCGTCAGTGGCCTTTTCAACATTCCCCCAGCCTTCCAGCTGCAAG 
TCCGTCCCACTGACCTTCACTCCACCACGCAGACCAGATGAAGGAAGGCAACTTGGAAGT 

CCGTGGGCCTGGGTGTGAGGAATACTGACTCAGCCCTCTCTCTCTCTCTCTCCTCACCAG 
GGAAGCGTGTCTGCCTTGGAGAGGGCCTGGCAAAAGCGGAGCTCTTCCTCTTCTTCACCA 
CCATCCTACAAGCCTTCTCCCTGGAGAGCCCGTGCCCGCCGGTACACCCTGAGCCTCAAG 
CCCACCGTCAGTGGCCTTTTCAACATTCCCCCAGCCTTCCAGCTGCAAGTCCGTCCCACT 
GACCTTCACTC C AC C ACGCAG ACCAG ATG AAG G AAGGC AACTTGG AAGTGGTG G GTG CC C 
[C,T,AJ 

GGACGGTGCCTCCAGCCTCAACAGTGGGCATGGACAGGGTTAATGTCTCCAGAGTGTACA 
CTGCAGGCAGCCACATTTACACGCCTGCAGTTGTTTTCCGGAGTCTGTCCCACGGCCCAC 
ACGCTCACTTGACTCATGCTGCTAAGATGCACAACCGCACACCCATACACAACTACAAGG 
GCCACAAAGCAACTGCTGGGTTAGCTTTCCACAGACATAAATATAGTCCATCTGCAATCA 
CAAGCACATAGCCAGGTAACCCACCAACTCCCCTGGATCTGCAGCCCACACGTGGGAGTC 
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^AGCCCTCAGTCACCCCTTTTT^^ 

TCTGCGATTATGCACAGAGACTTTGGACATACGAGGACCCTCAGACCGGAGGAACACCTG 

CCCAACCCCAACACGTGCTTATGTAACCACGTGGAAAGCGGCCCCTGCTGCCCCTCCACA 

CACACATACACACTCACTGATCTACAGCCCCTGTTCGGCGTCAGAGTCCCCACTAGACCC 

AGTGGAAGGGGTTAGAGACCAAGTAGGGGCCAGTTTCCAATTCACCCTGTCAGGGAGTGA 
LC,T,GJ 

CCGGATCTGACGTTCCTTGTGACTTAAGGGTCCGGCTTGGGMTTAAAGTTTGTTTCTGG 
CCTTTAGCCTACTGCGTGTGTGACCCGTGTCAGTCACTGTGAGTAAGGGGTGGGGACAGG 
GGAGTCCACCCCTCCCCTGAGGCTGGGCGGGAGCTGAAAAACATGGCCACCGCCCACCCT 
GGCTGTTGACATCAGGACCAGATGTGGAGCTGGGAGGAGGGGCAGGGCTGGTGACGCCCT 
GGGCCTCATTTCCAAAAAGGGCCAAGGTGTCCGGCGGTGGGAAGTGGGCAAGGAGGGGGT 



Chromosomal Map Position 
ePCR to dbSTS 

Site (bases) Marker chr. 
15155.. 15288 stSG46708 19 



Organism 
Homo sapiens 
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